Accurate keyword spotting using strictly lexical fillers
نویسندگان
چکیده
Our goal is to design an accurate keyword spotter that can deal with any size of keyword set, since the size actually required in a wide range of applications is large (number of airports, number of names in a directory, etc.). This justi es the choice of an architecture based on a large-vocabulary continuous-speech recognizer. In a previous paper [1] we introduced the use of strictly-lexical subword llers for keyword spotting based on the INRS large-vocabulary continuous-speech recognizer [2] showing that they are, when compared to acoustic llers, a good compromise between memory and time consumption, keyword choice freedom and task-independence training on one hand and accuracy on the other hand. We propose here two new high-performance designs of individual strictly-lexical subword llers that perform, this time, better than their acoustic counterparts while still keeping the mentioned advantages.
منابع مشابه
New efficient fillers for unlimited word recognition and keyword spotting
This paper describes our complete results for improved lexical llers as well as two new kinds of llers, gives their results in unlimited speech recognition as well as for keyword spotting and compares them to the acoustic-phonetic ller in the case of keyword spotting. Tests have been conducted on di erent vocabularies derived from ATIS and the Wall Street Journal database. Results for keyword s...
متن کاملSpecific language modelling for new-word detection in continuous-speech recognition
The objective of this work is to allow the INRS continuous-speech recognizer to process accurately new words and incorporate them into the vocabulary. Until now only a few new-word detectors have been reported, all of them defining an acoustic filler model different from the models used to represent vocabulary words. In this paper, we define several designs using, unlike other researchers, stri...
متن کاملSubword Units for a Mandarin Keyword Spotting System
This paper is concerned with the problem of phonetic modeling in a Mandarin keyword spotting system. The task is to detect 20 keywords from continuous speech in the Call Home corpus from the Linguistic Data Consortium (LDC). Different speech units are explored, including whole word, syllable, and demi-syllable (INITIAL and FINAL). In our speaker-independent HMM-based Mandarin keyword spotting e...
متن کاملDocument Image Retrieval Based on Keyword Spotting Using Relevance Feedback
Keyword Spotting is a well-known method in document image retrieval. In this method, Search in document images is based on query word image. In this Paper, an approach for document image retrieval based on keyword spotting has been proposed. In proposed method, a framework using relevance feedback is presented. Relevance feedback, an interactive and efficient method is used in this paper to imp...
متن کاملLexical Access-based Confidence Measure for a Spanish Keyword Spotting System
Keyword spotting deals with the search of a reduced set of keywords in audio content. Phone Lattice-based approaches are very fast but achieve poor results. HMM-based keyword spotting systems deal with filler models to absorb the Out-of-vocabulary (OOV) words and achieve best results although they are slower. We propose a technique which combines them in order to perform a confidence measure to...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997